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1 CGACCTGGCC GCCGGCCGCT CCTCCGCGCG CTGTTCCGCA CTTGCTGCCC 
51 TCGCCCGGCC CGGAGCGCCG CTGCCATGCG GCTGGCGCTG CTCTGGGCCC 
101 TGGGGCTCCT GGGCGCGGGC AGCCCTCTGC CTTCCTGGCC GCTCCCAAAT 
151 ATAGCCCTGC TGTCGATTCC CTCAGTACTG TCTTGGGGTG TCCTGGGACC 
201 TGCAGGTGGC ACTGAGGAGC AGCAGGCAGA GTCAGAGAAG GCCCCGAGGG 
251 AGCCCTTGGA GCCCCAGGTC CTTCAGGACG ATCTCCCAAT TAGCCTCAAA 
301 AAGGTGCTTC AGACCAGTCT GCCTGAGCCC CTGAGGATCA AGTTGGAGCT 
351 GGACGGTGAC AGTCATATCC TGGAGCTGCT ACAGAATAGG GAGTTGGTCC 
401 CAGGCCGCCC AACCCTGGTG TGGTACCAGC CCGATGGCAC TCGGGTGGTC 
451 AGTGAGGGAC ACACT7TGGA GAACTGCTGC TACCAGGGAA GAGTGCGGGG 
501 ATATGCAGGC TCCTGGGTGT CCATCTGCAC CTGCTCTGGG CTCAGAGGCT 
551 TGGTGGTCCT GACCCCAGAG AGAAGCTATA CCCTGGAGCA GGGGCCTGGG 
601 GACCTTCAGG GTCCTCCCAT TATTTCGCGA ATCCAAGATC TCCACCTGCC 
651 AGGCCACACC TGTGCCCTGA GCTGGCGGGA ATCTGTACAC ACTCAGACGC 
701 CACCAGAGCA CCCCCTGGGA CAGCGCCACA TTCGCCGGAG GCGGGATGTG 
751 GTAACAGAGA CCAAGACTGT GGAGTTGGTG ATTGTGGCTG ATCACTCGGA 
801 GGCCCAGAAA TACCGGGACT TCCAGCACCT GCTAAACCGC ACACTGGAAG 
851 TGGCCCTCTT GCTGGACACA TTCTTCCGGC CCCTGAATGT ACGAGTGGCA 
901 CTAGTGGGCC TGGAGGCCTG GACCCAGCGT GACCTGGTGG AGATCAGCCC 
951 AAACCCAGCT GTCACCCTCG AAAACTTCCT CCACTGGCGC AGGGCACATT 
1001 TGCTGCCTCG ATTGCCCCAT GACAGTGCCC AGCTGGTGAC TGGTACTTCA 
1051 TTCTCTGGGC CTACGGTGGG CATGGCCATT CAGAACTCCA TCTGTTCTCC 
1101 TGACTTCTCA GGAGGTGTGA ACATGGACCA CTCCACCAGC ATCCTGGGAG 
1151 TCGCCTCCTC CATAGCCCAT GAGTTGGGCC ACAGCCTGGG CCTGGACCAT 
1201 GATTTGCCTG GGAATAGCTG CCCCTGTCCA GGTCCAGCCC CAGCCAAGAC 
1251 CTGCATCATG GAGGCCTCCA CAGACTTCCT ACCAGGCCTG AACTTCAGCA 
1301 ACTGCAGCCG ACGGGCCCTG GAGAAAGCCC TCCTGGATGG AATGGGCAGC 
1351 TGCCTCTTCG AACGGCTGCC TAGCCTACCC CCTATGGCTG CTTTCTGCGG 
1401 AAATATGTTT GTGGAGCCGG GCGAGCAGTG TGACTGTGGC TTCCTGGATG 
1451 ACTGCGTCGA TCCCTGCTGT GATTCTTTGA CCTGCCAGCT GAGGCCAGGT 
1501 GCACAGTGTG CATCTGACGG ACCCTGTTGT CAAAATTGCC AGCTGCGCCC 
1551 GTCTGGCTGG CAGTGTCGTC CTACCAGAGG GGATTGTGAC TTGCCTGAAT 
1601 TCTGCCCAGG AGACAGCTCC CAGTGTCCCC CTGATGTCAG CCTAGGGGAT 
1651 GGCGAGCCCT GCGCTGGCGG GCAAGCTGTG TGCATGCACG GGCGTTGTGC 
1701 CTCCTATGCC CAGCAGTGCC AGTCACTTTG GGGACCTGGA GCCCAGCCCG 
1751 CTGCGCCACT TTGCCTCCAG ACAGCTAATA CTCGGGGAAA TGCTTTTGGG 
1801 AGCTGTGGGC GCAACCCCAG TGGCAGTTAT GTGTCCTGCA CCCCTAGAGA 
1851 TGCCATTTGT GGGCAGCTCC AGTGCCAGAC AGGTAGGACC CAGCCTCTGC 
1901 TGGGCTCCAT CCGGGATCTA CTCTGGGAGA CAATAGATGT GAATGGGACT 
1951 GAGCTGAACT GCAGCTGGGT GCACCTGGAC CTGGGCAGTG ATGTGGCCCA 
2001 GCCCCTCCTG ACTCTGCCTG GCACAGCCTG TGGCCCTGGC CTGGTGTGTA 
2051 TAGACCATCG ATGCCAGCGT GTGGATCTCC TGGGGGCACA GGAATGTCGA 
2101 AGCAAATGCC ATGGACATGG GGTCTGTGAC AGCAACAGGC ACTGCTACTG 
2151 TGAGGAGGGC TGGGCACCCC CTGACTGCAC CACTCAGCTC AAAGCAACCA 
2201 GCTCCCTGAC CACAGGGCTG CTCCTCAGCC TCCTGGTCTT ATTGGTCCTG 
2251 GTGATGCTTG GTGCCAGCTA CTGGTACCGT GCCCGCCTGC ACCAGCGACT 
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2301 CTGCCAGCTC AAGQGACCCA CCTGCCAGTA CAGGGCAGCC CAATCTGGTC 
2351 CCTCTGAACG GCCAGGACCT CCGCAGAGGG CCCTGCTGGC ACGAGGCACT 
2401 AAGGCTAGTG CTCTCAGCTT CCCGGCCCCC CCTTCCAGGC CGCTGCCGCC 
2451 TGACCCTGTG TCCAAGAGAC TCCAGTCTCA GGGGCCAGCC AAGCCCCCAC 
2501 CCCCAAGGAA GCCACTGCCT GCCGACCCCC AGGGCCGGTG CCCATCGGGT 
2551 GACCTGCCCG GCCCAGGGGC TGGAATCCCG CCCCTAGTGG TACCCTCCAG 
2601 ACCAGCGCCA CCGCCTCCGA CAGTGTCCTC GCTCTACCTC TGACCTCTCC 
2651 GGAGGTTCCG CTGCCTCCAA GCCGGACTTA GGGCTTCAAG AGGCGGGCGT 
2701 GCCCTCTGGA GTCCCCTACC ATGACTGAAG GCGCCAGAGA CTGGCGGTGT 
2751 CTTAAGACTC CGGGCACCGC CACGCGCTGT CAAGCAACAC TCTGCGGACC 
2801 TGCCGGCGTA GTTGCAGCGG GGGCTTGGGG AGGGGCTGGG GGTTGGACGG 
2851 GATTGAGGAA GGTCCGCACA GCCTGTCTCT GCTCAGTTGC AATAAACGTG 
2901 ACATCTTGGA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 
2951 AAAAAAAAAA AAAAAAAA 
(SEQ ID NO: 1) 
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features: 

5'UTR: 1-75 

Start Codon: 76 

Stop Codon: 2641 

3'UTR: 2644 

Homologous proteins: 

Top 10 BLAST Hits: 

Sequences producing significant alignments: 
Value 

CRA 1 335001098640323 /altid=gi 17451525 /def=pir| |G02390 disinteg.. 
CRA 1 335001098639998 /altid=gi 111497002 /def=ref |NP_003806.2| a .. 
CRA| 1000682348196 /altid=gi 19945328 /def=ref |NP_064704.1| a dis.. 
CRA | 18000005154484 /altid=gi 16752962 /def=ref |NP_033744.1| a di.. 
CRA 1 1000737073449 /al ti d=gi 1 6682839 /def=db j | BAA88903 . 1 i (AB022 . . 
CRA | 157000140328366 /altid=gi 112720142 /def=ref |XP_010635.1| a .. 
CRA 1 18000005119563 /al ti d=gi 1 4501905 /def =ref I NP_003465 . 1 1 a di . . 
CRA 1 98000043629034 /al ti d=gi 1 13027660 /def=gb | AAC08702 . 2 1 (AF02 . . 
CRA | 18000005009258 /altid=gi 16680640 /def=ref |NP_031426.1| a di.. 
CRA | 98000043606871 /altid=gi | 12802370 /def=gb|AAK07852.1|AF3113. . 
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0.0 


1698 


0.0 


1377 


0.0 


1351 


0.0 


1319 


0.0 


970 


0.0 


539 


e-152 


539 


e-152 


538 


e-151 


517 


e-145 



FIGURE 1B 



Docket No.: CL001177DIV2 
Serial No.: (to be assigned) 
Inventors: Gennady MERKULOV et al. 
Title: ISOLATED HUMAN METALLOPROTEASE... 



EST: 

Score E 

Sequences producing significant alignments: (bits) 
Value 



gi 112777372 /dataset=dbest /taxon=960. 
gi 110205626 /dataset=dbest /taxon=96.. 
gi I 10746030 /dataset=dbest /taxon=96.. 
gi I 12758166 /dataset=dbest /taxon=960. 
gi I 13130161 /dataset=dbest /taxon=960. 
gi 111003698 /dataset=dbest /taxon=96.. 
gi 112763891 /dataset=dbest /taxon=960. 
gi 1 9124688 /dataset=dbest /taxon=9606. 



1750 0.0 

1364 0.0 

1352 0.0 

1334 0.0 

1306 0.0 

1298 0.0 

1281 0.0 

1211 0.0 



EXPRESSION INFORMATION FOR MODULATORY USE: 

gi 112777372 placenta 

gi 110205626 lung 

gi I 10746030 ovary 

gi 112758166 colon 

gi I 13130161 kidney 

gi I 11003698 thyroid gland 

gi 112763891 prostate 

gi 1 9124688 eye 

Tissue expression: 
leucocyte 
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1 MRLALLWALG LLGAGSPLPS WPLPNIALLS 
51 AESEKAPREP LEPQVLQDDL PISLKKVLQT 
101 LLQNRELVPG RPTLVWYQPD GTRWSEGHT 
151 CTCSGLRGLV VLTPERSYTL EQGPGDLQGP 
201 RESVHTQTPP EHPLGQRHIR RRRDWTETK 
251 HLLNRTLEVA LLLDTFFRPL NVRVALVGLE 
301 FLHWRRAHLL PRLPHDSAQL VTGTSFSGPT 
351 DHSTSILGVA SSIAHELGHS LGLDHDLPGN 
401 FLPGLNFSNC SRRALEKALL DGMGSCLFER 
451 QCDCGFLDDC VDPCCDSLTC QLRPGAQCAS 
501 RGDCDLPEFC PGDSSQCPPD VSLGDGEPCA 
551 LWGPGAQPAA PLCLQTANTR GNAFGSCGRN 
601 QTGRTQPLLG SIRDLLWETI DVNGTELNCS 
651 ACGPGLVCID HRCQRVDLLG AQECRSKCHG 
701 OTQLKATSS LTTGLLLSLL VLLVLVMLGA 
751 QYRAAQSGPS ERPGPPQRAL LARGTKASAL 
801 SQGPAKPPPP RKPLPADPQG RCPSGDLPGP 
851 SSLYL 
(SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOCOOOOl PS00001 ASN_GLYCOSYLATION 

N-glycosylation site 

Number of matches: 5 

1 254-257 NRTL 

2 406-409 NFSN 

3 409-412 NCSR 

4 623-626 NGTE 

5 628-631 NCSW 



[2] PDOC00005 PS00005 PKC_PH0SPH0_5TTE 
Protein kinase C phosphorylation site 

Number of matches: 11 



1 


53-55 SEK 


2 


73-75 SLK 


3 


199-201 SWR 


4 


283-285 TQR 


5 


411-413 SRR 


6 


589-591 TPR 


7 


602-604 TGR 


8 


611-613 SIR 


9 


686-688 SNR 



IPSVLSWGVL GPAGCTEEQQ 
SLPEPLRIKL ELDGDSHILE 
LENCCYQGRV RGYAGSWVSI 
PIISRIQDLH LPGHTCALSW 
TVELVIVADH SEAQKYRDFQ 
AWTQRDLVEI SPNPAVTLEN 
VGMAIQNSIC SPDFSGGVNM 
SCPCPGPAPA KTCIMEASTD 
LPSLPPMAAF CGNMFVEPGE 
DGPCCQNCQL RPSGWQCRPT 
GGQAVCMHGR CASYAQQCQS 
PSGSYVSCTP RDAICGQLQC 
WVHLDLGSDV AQPLLTLPCT 
HGVCDSNRHC YCEEGWAPPD 
SYWYRARLHQ RLCQLKGPTC 
SFPAPPSRPL PPDPVSKRLQ 
GAGIPPLWP SRPAPPPP7V 
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10 760-762 SER 

11 796-798 SKR 



[3] PDOC00006 PS00006 CK2_PHOSPHO_JSITE 
Casein kinase II phosphorylation site 

Number of matches: 8 



1 


81-84 SLPE 


2 


199-202 SWRE 


3 


208-211 TPPE 


4 


283-286 TQRD 


5 


500-503 TRGD 


6 


522-525 SLGD 


7 


589-592 TPRD 


8 


611-614 SIRD 



[4] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 18 

1 10-15 GLLGAG 

2 145-150 GSWVSI 

3 323-328 GTSFSG 

4 358-363 GVASSI 

5 404-409 GLNFSN 

6 422-427 GMGSCL 

7 475-480 GAQCAS 

8 532-537 GQAVCM 

9 555-560 GAQPAA 

10 571-576 GNAFGS 

11 583-588 GSYVSC 

12 596-601 GQLQCQ 

13 624-629 GTELNC 

14 637-642 GSDVAQ 

15 670-675 GAQECR 

16 682-687 GVCDSN 

17 714-719 GLLLSL 

18 774-779 GTKASA 



[5] PDOC00016 PS00016 RGD 
Cell attachment sequence 

501-503 RGD 
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[6] PDOC00021 PS01186 EGF_2 
EGF-like domain signature 2 



690-701 CYCEEGWAPPDC 



[7] PDOC00129 PS00142 ZTNCPROTEASE 

Neutral zinc metal! opeptidases, zinc-binding region signature 



362-371 SIAHELGHSL 



Membrane spanning structure and domains: 



Helix Begin 


End 


Score Certainity 


1 


25 


45 


1.602 Certain 


2 


144 


164 


0.925 Putative 


3 


317 


337 


1.237 Certain 


4 


430 


450 


0.768 Putative 


5 


547 


567 


0.601 Putative 


6 


640 


660 


1.243 Certain 


7 


711 


731 


2.394 Certain 



BLAST Alignment to Top Hit: 
Alignment to top blast hit: 

>CRA | 335001098640323 /altid=gi 17451525 /def=pir| |G02390 disintegrin 
and metal loproteinase MDC15 (EC 3.4.24.-) - human 
/org=human /taxon=9606 /dataset=nraa /length=814 
Length = 814 

Score = 1714 bits (4390), Expect = 0.0 

Identities = 812/855 (94%), Positives = 812/855 (94%) 

Frame = +1 

Query: 76 MRU\LLV^LGLLGAGSPLPSWPLPNTALI^IPSVLSW^LGPAGGTEEQQAESEKAPREP 255 

MRLALLWALGLLGAGSPLPSWPLPNI GGTEEQQAESEKAPREP 
Sbjct: 1 MRLALLWALGLLGAGSPLPSWPLPNI GGTEEQQAESEKAPREP 43 

Query: 256 LEPQVLO£HDLPISLKKVLQTSLPEPLRIKLELDGDSHILELLQNREL\^ 435 

LEPQVLCPDLPISLKK\A.QTSLPEPLRIKLELIXDSHILELLQNREL\^RPTL\^QPD 
Sbjct: 44 LEPQVLCPDLPISLKKVLQTSLPEPLRIKLELiXjDSHILELLQNRELVPGRPTLVWYQPD 103 

Query: 436 GTRWSEGHTLr^CCr-QGRVRC^AGSWSICrCSGLRGLWLTPERSYTLEQGPGDLQGP 615 

GTRWSEGHTLENCCr-C^RWGYAGSWSICTCSGLRGLVVLTPERSYTLEQGPGDLQGP 
Sbjct: 104 GTRWSEGHTLENCCt-QGRVRGYAGSWVSICTCSGLRGLWLTPERSYTLEQGPGDLQGP 163 
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Query: 616 PIISrttQDLHLPGHTCALSWRESVIW 795 

PIISRICPLHLPGHTCALSWRESV^ 
Sbjct: 164 PIISRICpLHLPGHTCALSWRESVHTQTPPEH^ 223 

Query: 796 SEAQIOrTOFQHLLNRTLEVALLLDTFFRPLW^^ 975 

SEAC^^FC^LLNRTLEVALLLDTFFRPLIWRVALVGLEAWTQRDLVEISPNPAVTLEN 
Sbjct: 224 SEAQKYRDFQHLLNRTLEVALLLDTFFRPLNVRVALVGLEAWTQRDLVEISP^ 283 

Query: 976 FUHWRRAHLLPRLPHDSAQLWCTSF H55 

FLHWRRAHLLPRLmDSAQLWGTSFSGPTv'GWQNSICSroFSGGVNMDHSTSILGVA 
Sbjct: 284 FLHWRRAH LL PRL PHDSAQLVTGTS FSGPTVGMAIQNSICSPDFSGGVNivDHSTSI LGVA 343 

Query: 1156 SSIAHELGHSLGLDHDLPGNSCPCPGPAPAKTCIMEASTDFLPGLNFSNCSRRALEKALL 1335 

SSIAHELGHSLGLDHDLPGNSCPCPGPAPAKTaMEASTDFLPGLNFSNCSRRALEKALL 
Sbjct: 344 SSIAHELGHSLGLDHDLPGNSCPCPGPAPAKTCIMEASTDFLPGU^FSNCSRRALEKALL 403 

|, 4 Query: 1336 dgmgsclferlpslppmaafcgnmfvepgeq^ 1515 

G DGMGSCLFERLPS LPPMMFCGNMFVEFGEQOXIGFLDDCVDPCaDS LTCQLRPGAQCAS 

□ Sbjct: 404 DGMGSCLFERLPSLPPMMFCGNMFVEPGEQCI^^ 463 

»P Query: 1516 DGPCCQNCQLRPSGWP*Fm*GTXDLPEFCPGDSSQCPP^^^ 1695 
P ^PCCQNCQLRPSGV\QCRPTRGIXDLPEFCPGDSS<^PPD^^ 

'J Sbjct: 464 TCPCC(^CQLRPSGWQCRPTRGIXDLPEFCreDSS(^ProVSLGDGEPCAGGQAVCMHGR 523 

I* Query: 1696 (^YAQQCQSL^PC^QPAAPLCLQTAr^GI^FGSCGRNPSGSWSCTPRDMCGQLQC 1875 

I II f . CASYAQQCQSLV^CPGAQPMPLCLQTA^^^lG^^FGSCGRNPSGSWSCTPRDAICGQLQC 

u sbjct: 524 CASYAQQCQSL\flCPGAQPMPLCLQTAr^ 583 

□ Query: 1876 QTGRTQPLLGSIRDLLWmDWGTELNCS\WHIJDLGSDVAQPLLTLPGTACGPGLVCID 2055 
i ! * QTGRTQPLLGSiraLLWmDWGTELNCSWVHLDLGSDVAQPLLTLPGTACGPGLVaD 

Sbjct: 584 QTGRTQPLLGSIRDLLWETIDVNGTELNCSWVHLDLGSDVAQPLLTLPGTACGPGLVCID 643 

Query: 2056 HRCQRN^LLGAQECRSKCHGI^CDSNRHCYCEEGWAPPIXTrQLKATSSLTrGLLLSLL 2235 

HRCQRVDLLGAQECRSKCHGHGVCDSNRHCYCEEGWAPRXTTQLKATSSLTrGLLLSLL 
Sbjct: 644 HRCQRVDLLGAQECRSKCHGHGVCDSNRHCYCE 703 

Query: 2236 VLLVLVMLGASYWYRARLr^LCQLKGPTCQYRMQSGPSERPGPPQRALLARGTKASAL 2415 

VLLVLVMLGASYWYRARL QRLCQLKGPTCQYRAAQSGPS ERPGPPQRALLARGTK 
Sbjct: 704 VLLVLX^LGASYWYRARDCQRLCQLKGPTCQYRMQSGPSERPGPr^RALU^G^ 759 

Query: 2416 SFPAPPSRPLPPDPVSKRLQSQGPAKPPPPRKPLPADPQGRCPSGDLPGPGAGIPPLWP 2595 

SQGPAKPPPPRKPLPADPQGRCPSGDLPGPG GIPPLWP 
SbjCt: 760 SQGPAKPPPPRKPLPADPQGRCPSGDLPGPGPGIPPLWP 799 
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Query: 2596 SRPAPPPPTVSSLYL 2640 

SRPAPPPPTVSSLYL 
Sbjct: 800 SRPAPPPPTVSSLYL 814 (SEQ ID NO: 4) 

Hmmer search results (Pfam): 

Scores for sequence family classification (score includes all domains): 



Model 


Description 


Score 


E-value 


PF01421 


Reprolysin (M12B) family zinc metal! oproteas 


259.3 


5.3e-74 


PF01562 


Reprolysin family propeptide 


128.4 


2.1e-35 


PF00200 


Disintegrin 


70.0 


3.4e-22 


CE00385 


E00385 pi atel etL_aggregati on_acti vati on_i nhi b 


26.5 


5.4e-06 


PF00035 


Double-stranded RNA binding motif 


7.2 


1.2 


CE00423 


E00423 stromelysinJL 


4.5 


0.99 


PF01400 


Astacin (Peptidase family M12A) 


2.6 


7.8 



Parsed for domains: 



Model 


Domain 


seq-f seq-t 


hmm-f hmm-t 


score 


E-value 


PF01562 


1/1 


100 


217 .. 


1 


119 [] 


128.4 


2.1e-35 


PF01400 


1/1 


363 


373 .. 


91 


101 .. 


2.6 


7.8 


CE00423 


1/1 


364 


375 .. 


222 


233 .. 


4.5 


0.99 


PF01421 


1/1 


230 


428 .. 


1 


200 [. 


259.3 


5.3e-74 


CE00385 


1/1 


447 


518 .. 


1 


67 [. 


26.5 


5.4e-06 


PF00200 


1/1 


447 


523 .. 


1 


76 [] 


70.0 


3.4e-22 


PF00035 


1/1 


734 


766 .. 


1 


37 [. 


7.2 


1.2 
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1 TTGGGTGACC CTGGGCAGTG ATCACATCTC CAAGCATCAG TTTTCTCACC 
51 TGAAAAAAAG GAGATGATAA TAACACTATC TGCCTTACAT GACAATTGAA 
101 TTGAAI I I I I I I I I I I I I I I TGAGACTAAG TCTCACTCTG TCGCCCAGGC 
151 TGGAGTGCAG TGGCGTGATC TTGGCTCACT GCAACCTCCA CCTCCCCAGT 
201 TCAAGCGATT CTCGTGCCTC AGCTTCCCGA GTAGCTGGGA TTACAGGCAC 
251 ACACTACCAC GCCCGGCTAA TTTAGAATTG AAATAATTTA TGTACAGTAT 
301 CTTAGTACAG GACCTGACAT TATAAACAAT GAGTGGCAGC CATTCTTATT 
351 TAATCAGTCC TAACAAAGTT CATAAAAGTG AGACTGTGTT TGCTTAGCTT 
401 TTTCCCTAGG GCCTGGATAC CCCCAGCCCC CATGACACAC AATAGGGGCC 
451 AAATGAATGT GTTGTGAAAA AATGAAAAAC AAAAAACAAA AAAGAACATG 
501 CTGGGATTCC TTGACAGGGT CGTGAAGCAA ACTGAATGTG AATGCACAGA 
551 TGGAAATGTG CCAGACAGTC ATTCCAAGCA GAATGTGCAA AGACTCAGTC 
601 CACAGGGAAT GCGAAGTGCC AGGGCTAGTC TCAGGAGAAA CTGGCTCAGA 
651 AGAGACAGCT CTCAGGGAGG GCTAAAGTAG GAAAGAGGCT AGAAAGGGAC 
701 CAGGTGAGGG AAGGCTCTGA AGGCCAAGCC CAAGAGTTCT GCCTGTCTGG 
751 CAGGCAGCAG GGCCTCTGGA GTTTCTTGGG CAAAGAGTGG CTGCTTCCTG 
801 GGTAAGGTGG CCTGTGGAAA ATCCCTGACA ACTGTGTAGA GACATGTCGT 
851 GAGGGATGGC AGGGAGCATA GTGAACTAGG TTTGTGGTTT GGAATCAGGG 
901 CCCCTGGGGT CCAGCCAAGT TGGATTGTTT ACTATCTGTG TGACTTTGAG 
951 AGTCACTTCA CCTTTCTCAA CTGTAAAGTG GGGATAGCAA CAGTGATAGT 
1001 CGATCTGGCC TGCTCACTTC TCAGCCTCAC TGTGAGAACC AAATAAGATG 
1051 ATTTACAGGA AAGTGCAAAT GAGAGTTGTG GCTGATATCC GCTTGGAGAG 
1101 AGCCTGGAGG GTGCATCCTC CCATTCTCCA TCACAGAGTT GGGGAGGGAG 
1151 GCACCCTCGC CCTCCAGGGG TTTCCTTTGT CCAACCCAGC CTCCTCCAAC 
1201 ACGCGGGAAT TGTCAGGCCT GGCGACTTCA GACAGGAAAC GCTGTCCAGT 
1251 TCCCCTTCTT TCCCGCCTCG CTCCCGGGCT GGCGCTAACG CCCACCTCCC 
1301 AACAGCGCCA CCCGCTGGCG GATATCCTGC ACCGCGGCTG CCCGCTCCTG 
1351 CGCCGCTGGC TGTGCCGGCG CTGCGTGGTG TGCCAGGCAC CCGAGACGCC 
1401 CGAGTCCTAC GTGTGCCGGA CGCTGGACTG CGAGGCCGTG TACTGCTGGT 
1451 CGTGCTGGGA CGACATGCGG CAGCGGTGCC CGGTCTGCAC GCCCCGCGAA 
1501 GAGCTCTCTT CCTCCGCCTT TAGTGACAGC AACGACGACA CTGCCTACGC 
1551 GGGGTGAAGA GGCGTCCTGC TCGCTCTTCC GCACCGTCCT TCCCGGTTAA 
1601 TAAAATGCCC TGTACGCTTC ACGTGGGTCG GGGACTGGGG TGAGCCGCGC 
1651 ACTGCCTCGC CTGCAGTCGG GAAAGCCTGC CCGCCCGACC TCTCCGAGCC 
1701 AGGCCGCGCA CAGGAGGCAG GGAGGCCGCG AAGCTACTAG GGAGGGGTCC 
1751 GGACCTGGCG CCGGGTGAAG GCGCGCCGCC CAAGCCGGTC GGACCGGGCA 
1801 CCGGCTCCCA CTCCGCACAG TTGCGGGGAA GCGGTAGCGC TGAGCAGCGC 
1851 GGGCGTAGTG GGCGGTGTCC CCGCTCCCGA GGCACCCGGC GCGCAGCGGG 
1901 GCGGGCTTTG CCGGGGGCGG AGCTTGGCTT GGGGCCGGGT GGGAGGGGGC 
1951 GGGCCGGGGC GGGGCCTGGT GGCCGCGCGG CGCTGCTGGG TTCTCCGAGG 
2001 CGACCTGGCC GCCGGCCGCT CCTCCGCGCG CTGTTCCGCA CTTGCTGCCC 
2051 TCGCCCGGCC CGGAGCGCCG CTGCCATGCG GCTGGCGCTG CTCTGGGCCC 
2101 TGGGGCTCCT GGGCGCGGGC AGCCCTCTGC CTTCCTGGCC GCTCCCAAAT 
2151 ATAGGTGAGT CCTCCGCCTG GAGTGGGTCG GGGGGCGGAC TGGGAGGGAG 
2201 GTGCAGGAAA GTCGGAAGGC ATTAGGGTAA TGGGGCCGGA CGGAGACCCT 
2251 GGGAGAGCCC AGCCAGAGCG CGGCCCGCCC TGGTCCGCTG TCCTGGGCCT 



FIGURE 3A 



Docket No.: CL001177DIV2 
Serial No.: (to be assigned) 
Inventors: Gennady MERKULOV et al. 
Title: ISOLATED HUMAN METALLOPROTEASE... 

2301 AGGGCCCGGT GACTTGGCGA TGQGGTGAAA AGAGAAGGAG GGGGGATGCC 
2351 GGCGCCCCCT GCCTCCTGCC TGGTCATCCT CTGCGCGGTC CCTGCGGACA 
2401 CTTTCAGGCT CAGGTACCAG GTACCGAGGG GCCTGTCCAG CGCCACTTCA 
2451 AGATCGTGAT GAGAGGGTCG CTGCTCCCCA GGACTGGCAT CTTCGCTGCT 
2501 CTGGGGCCTA GCTAACCGTT CCACCCGGTG CCAGGGCGCT GAGCGGGCAT 
2551 GGCTTGTAGG GTTTAGTGAA GAGGATTCTC TCTAGCCTCT ATTCCAGGCC 
2601 TGGGGCCGCC AGGCACTCCT CACCCTGGTG CTGTTGCCAC CAGTGCCTGG 
2651 CCGAGCGGGA GGGGCCCGAG ATGAGCCAGG AGAAGGGAGA ATTGGCCAGG 
2701 AAAGAGGCTG GGACACCAAC TCCTCCTTGG AACTTTCACT TCCCGCTGCT 
2751 GTCTTGGGCC GGGACCGAGA GGGCAGGCGC GGGTGGAGTG TCCGGAGGAG 
2801 AGAGGGCCAT TGTGTGTTGG GGGGGTGGGG GGTGCTCGAG GAGGAAGCAG 
2851 AGGCTGTAGG CAGCGGGTGT GCCTGACTGG GCATGAGGGT GTTTAGGGAG 
2901 GTGGGGGTGT TTGCACTGCT CACCCAGAAA TGGGCGTTCC TGGCATCTCC 
2951 GATGTGAGCG AAGGGGAGGG TGAGCGGGCA CCCGGCCACA AGGCTTAGCT 
3001 CAGTCTCGAG AGGGGGCGTT CCTGAAGTGG GGGGAGAGTG ATTGGGAGGG 
3051 AGTGGGAACC GCGGAGGGTC CTGTGAGAAC CTGGGATTGG CCGGAAGGGG 
3101 ACAAGGAGGG CCACAGGCTG CGCAAGCCGA AAGTCTTTCT TGGGGACTTG 
3151 TGAATGGGTT GGTGGGTGGA AAGCCATAAA TTAGAGAGAC ACCCTCTCCT 
3201 TCCAGTATTC TTCTTTAAGT CTCAGCATGC AATGTGGAAG CCCCTCAGGT 
3251 ACCTAAGGGT CTTGATGGGC TGGGAGCTGG TGGATCTGAG GGCACCTGTC 
3301 ACCCCCAGCC CTGCTGTCGA TTCCCTCAGT ACTGTCTTGG GGTGTCCTGG 
3351 GACCTGCAGG TGGCACTGAG GAGCAGCAGG CAGAGTCAGA GAAGGCCCCG 
3401 AGGGAGCCCT TGGAGCCCCA GGTCCTTCAG GACGATCTCC CAATTAGCCT 
3451 CAAAAAGGTG CTTCAGGTGA GCTCTCACTC CCCTCTAATA AATAAACGAA 
3501 TCCACACACG CCCCGGTATA GCCAGGTGTC TCAAAGCCAA AGCTTGGCTG 
3551 AGGAGCTGGT GGGTAGAGCT CACTGTAGTG GGTCTATCCC AGGCCCAGCT 
3601 GCCTCTCCCA CCACACCCCA GCACCTGGCT TCACTTATCT CCCTCTCCCT 
3651 CTGCACACAC GTGTATCTGT CTGCCTCAGC CCCACCCAAC CCATCCATCT 
3701 CCACTGGGGA AATTGTGAAG CCAAACTTGC 1 1 IU I CATC TCATGTTGTC 
3751 GGTTTTCTCA GTGGGGGGAT TTGGAAAGAG TCAGGACCTT ACCAAACCCC 
3801 CCCCCCCCAC CCCATTCTAA AGCTGAGTCA GAGGAAGGGC TGGGGCTTGT 
3851 GCTGGGTCCT ACACGGTGCT TCCTCTCTGG GCAGGAAGCC GAGAAGGGGT 
3901 GGCTCAGATA CCTTCCTTGA CCTCCGCACA CAACCCCCCA GAACAATGCT 
3951 CCAGGCCAGG CAGGGTTTCC TGGCCCCTCC CCTGGGATCC CCCCACCAGT 
4001 GATCTAATTG CTGGTGCTCT TCTGTGGGCC TGAGGTTTTC TGGTTAGAGA 
4051 GGCTGGGAGT TGTGGACAGG TCTAGGGAGG TGACCTGCCC TCTGGTGCCC 
4101 ACAGACCAGT CTGCCTGAGC CCCTGAGGAT CAAGTTGGAG CTGGACGGTG 
4151 ACAGTCATAT CCTGGAGCTG CTACAGAATA GGTAATAGTG ATGGTGGCAA 
4201 TAACAGTGAC CACATGGCCA ACAACTTGTA TAGCATTTAT TATGTGCCAG 
4251 GTACTAAGTG CTTGTGCTCA TTTAATCCTC ATAACAGCCC TATAAGGGAT 
4301 ATACTATCAT GTATTATTGT CCTCACTTTA TACATGAGGA AGTCAAGGCA 
4351 CAGAGAGATT AAATAACTTG CCCCAGGTCA CACAGCTAGT ATGTGGTGAA 
4401 AACCAGATTG GAATTCAAAT AAACTAACAG AGTCAGTGGC CCAACCAGTA 
4451 TACTTTGCTG CCCCGGGGTC AGGAGTGGAA AAGTTGGCTG CGGGGGTTGC 
4501 CTGGTCCCCA GCCCCACAAC CACCTTCAAG CCTCTGCTTG TCAATGCACC 
4551 GACCCTGGGA AGTGGCTTTA GCACTGCCTT CTTTTTCTTC ACTTCACAGG 
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4601 GGAGTTGGTC CCATGTCCGC CCCGACCCTT GGGGTCCGGC WCCCCTCT 
4651 CCCCCGTCG GCGCCGCCCC TTCCCTTTTC I I ICI ICCCC TCCGCTTTCG 
4701 TCCTTTTGCC TCCCCCGTGC CGTTGCGCGT TCCTTCTTCC CCGTTCCCTC 
4751 TCCCCTCTTT TGTTCCCTCC CGI ICI 1 1 IC TCCCCCGCGT TCTTTCCTCC 
4801 TCCTTTTCGG TCCGCCCTCG CCTTCCTCCC TTCCCCTTCT GCCCTTCGCC 
4851 NTTTCTCCCT CTCGTTCTTC CTCGGTGTCG CGTCGTCCCG GCTCGGCCTT 
4901 TCCCCGCTTC CTCCCGCTCG CCGI I I I 1 1 I CCCCCCGCTG TCTTCCCGTG 
4951 TTCCCCTTCG CTTCTCCTCT TCCCTTTCGT TCGGTCGTTT TCTCGTTCCA 
5001 TTCCCGCCTC CCCGTTTCCG TTCCACTCCT TCTTCCTCCT TTCCCGCTCC 
5051 CCGnTCTCC CGACCCCAAC AACAAATAAA NNNNNNNNNN NNNNNNNNNN 
5101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
5451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNTCAGG 
5501 AGGCCGAGTG GAAGAATCGC TTGAGCCCAG GTAGGCAGAG GTTTCAGTGG 
5551 GCCGAGATCG AGCCACTACA CACCAGCCTG GGTGAAAGAG TGAGACCTCG 
5601 TCTCAAAAAA TAAAATAAAA ATAAAATAAA ATAAAATCTA GCTGAGACAG 
5651 ATTAGGTGGT TTGCCCGAGG CCCTACAACT AATAAATGGC CTATCCATTT 
5701 ATTAGTTGTA TTTGGCTCTT CATCTGTCTT ATGATCCCAT TTGCAGAGAG 
5751 CTCTCACTTG GTTATAGATA ATACATAGTT ACCAATGATG AAGCAATATA 
5801 AACCCAATTT CCTAATTTGT AAAATGAAGA TAATAAAACT ACTTGCTGCA 
5851 TAGAGTTGCT GGGAAGATTA AATAAGTCCA TATAGATGTA AAGTGOTAA 
5901 AACTATGCCA GACCTATGGT AAGTGACAAG AG I IGI I ATT GGGAI 1 1 1 1 A 
5951 AAATTATTAT TATTATTATT ATTATTATTT GAGACAGAGT CTCGCTCTGT 
6001 CTCCCAGGCT GGAGTGCAGT GGCGTGATCT CGGCTCACTG CAAGCTCCGC 
6051 CTCCCAGGTT CACGCCATTC TCTTGCCTCA GCCTCCCGAG TAGCTGGGAC 
6101 TACAGGCGCC CGCCACTACA CCCGGCTAAT Gl I I IGTATT TTTTAGTACA 
6151 GACAGGGTTT CACCGTGTTA TCCAGGATGG TCTCGATCTC CTGACCTCAT 
6201 GATCCACCCG CCTTGTCCTC CCAAAGTGCT GAGATTACAG GCGTGAGCCA 
6251 CCGCACCCAG CTAAATTACT Gl I I 1 1 I AAA AATTTGAAAA AAACCACTGA 
6301 GTTTGGAGCC AGAAAAGCAG GGGTCTACTC CAACCTTCAT TATCTACTTC 
6351 CTGGTCCTCC TTGGCAAGTT CCTGGGCCCT CTGGCCTTCA GTGGCTCATC 
6401 TGTAAAATGG GCTCTTCACC CTCCTATTTG ACCCACAGAG TAGGAGTGGC 
6451 TGCCTCTTGG TCAGCCCGGC ACAGCTGCTG GCTGCGAGCG GCAGGTTTGC 
6501 CTGATAATTC TTCTTGTCCA TAGTAGAGGC GGGATGTGGT AACAGAGACC 
6551 AAGACTGTGG AGTTGGTGAT TGTGGCTGAT CACTCGGAGG TGAGCCTGCT 
6601 GGCCCCTGCA CATCCTCCTC CCCCTGCACT GCCCTGCCGC CTTTCATGTC 
6651 ACCTCTCTTG GCCTACAGGC CCAGAAATAC CGGGACTTCC AGCACCTGCT 
6701 AAACCGCACA CTGGAAGTGG CCCTCTTGCT GGACACAGTG AGTGCTGGAC 
6751 AGGGCAACCC CCACCCCAGG CCCCTGACCA TGGCAACCCC TCTTCTGAGC 
6801 CCCAGCTGTC TTTCAGTTCT TCCGGCCCCT GAATGTACGA GTGGCACTAG 
6851 TGGGCCTGGA GGCCTGGACC CAGCGTGACC TGGTGGAGAT CAGCCCAAAC 



FIGURE 3C 



Docket No.: CL001177DIV2 
Serial No.: (to be assigned) 
Inventors: Gennady MERKULOV et al. 
Title: ISOLATED HUMAN METALLOPROTEASE... 

6901 CCAGCTGTCA CCCTCGAAAA CTTCCTCCAC TGGCGCAGGG CACATTTGCT 
6951 GCCJCGATTG CCCCATGACA GTGCCCAGCT GGTGACGTAA GGGCCCCAGA 
7001 CTCAGCCAGA GAGGCCAGTC CTGTCCTGQC CAAATTCACA CCCCTTCAGC 
7051 ACCCTACCTC AQCCCCTGAA GCTCTGACCA CCGTGGCTTC TGGCCCTGAA 
7101 CTTTAGCCTC TCTGTCCCAC AGTGGTACTT CATTCTCTGG GCCTACGGTG 
7151 GGCATGGCCA TTCAGAACTC CATCTGTTCT CCTGACTTCT CAGGAGGTGT 
7201 GAACATGGTG AGTTATTTCC AGGTCTCCTC CTCATTCCCA ATTCAGTTCC 
7251 TCCCAAGTGT GGTGGCATTT ATGCACTGAA ACCCCCCTAT AAAGTTGCCC 
7301 AACCCCAAAG CTACAGGTAT AGAGGGTGGA GGTACGTGAT GTGGCCTTTG 
7351 CTATCAGGGA GCCCTCGCTT ATGGCCAGCT AGTCACAGTG TACACAGTCA 
7401 TCCCCTGTGC AGTCTTCCCA TTTCTTAGAG GAGGGTAGGA GGCAGCTAAG 
7451 GCCCAAAGAA CAGAGGTGAT CTCCCTCCAG TGAGGGAGGG GGACAGAGCT 
7501 GAGCTAGAAC CCAAGTTTCT GCCATCCAGG CCTGGGTTCT CCTACTTTAG 
7551 AAGCAATTCA GGAGGGAAGC AGTGCCTGCT GAGTGCCCAC GAGGTCAGAC 
7601 GTGGAGGGAA CAGGAGCAGA GAGGGTGGTC TGGGCATTGT GGTGGAGGCA 
7651 GGCTGGGACT GGACCTACAG TACCCCTCCC CAATGACAGG ACCACTCCAC 
7701 CAGCATCCTG GGAGTCGCCT CCTCCATAGC CCATGAGTTG GGCCACAGCC 
7751 TGGGCCTGGA CCATGATTTG CCTGGGAATA GCTGCCCCTG TCCAGGTCCA 
7801 GCCCCAGCCA AGACCTGCAT CATGGAGGCC TCCACAGAGT AAGTAGCTGC 
7851 AGGATGGAGA GAGGGTGTGG GGCAGGGGGC AGGGANNNNN NNNNNNNNNN 
7901 NNNNNNNNNN TGTTAGAGTT ACCTTCCTTG CCACCCTCCC CAGCTTCCTA 
7951 CCAGGCCTGA ACTTCAGCAA CTGCAGCCGA CGGGCCCTGG AGAAAGCCCT 
8001 CCTGGATGGA ATGGGCAGCT GCCTCTTCGA ACGGCTGCCT AGCCTACCCC 
8051 CTATGGCTGC TTTCTGCGGA AATATGTTTG TGGAGCCGGG CGAGCAGTGT 
8101 GACTGTGGCT TCCTGGATGT GAGCCCCTTT CCCAAAGCCT CGCCCCACTC 
8151 ACTTCTGTAC CCTCACCCTG GCTCATTAGC CCTATCCCAG CCTCCTGAGC 
8201 TCTTGGGTTC TGAAGGGACT TTCCACCCCT CTCCTACTTG CCCTGTCTGT 
8251 GGGGACAGCA CATGGGTTGT TGGGCTCTAG CCCTCGCTTG CTGTGTAGCT 
8301 TCTGGTCTTG GCCTGTGGGA GGAGGAGAGA TTGGAGGGAG GCTCACAGGC 
8351 CCCACCTGCT CTGATGCCCG GCCCCCGTGC TCCTGCCCAC AGGACTGCGT 
8401 CGATCCCTGC TGTGATTCTT TGACCTGCCA GCTGAGGCCA GGTGCACAGT 
8451 GTGCATCTGA CGGACCCTGT TGTCAAAATT GCCAGGTGGG TAGAGACTAG 
8501 ACTGGCCACC CGGAGCTCAC CTGCCGGGGC CAAGGTGGAA AGGGTCATTC 
8551 TGACCCCCGG CTGGATTTGC TCAGTGCCCA CACTGATGCT CATCCACCCT 
8601 CCACAGCTGC GCCCGTCTGG CTGGCAGTGT CGTCCTACCA GAGGGGATTG 
8651 TGACTTGCCT GAATTCTGCC CAGGAGACAG CTCCCAGTGT CCCCCTGATG 
8701 TCAGCCTAGG GGATGGCGAG CCCTGCGCTG GCGGGCAAGC TGTGTGCATG 
8751 CACGGGCGTT GTGCCTCCTA TGCCCAGCAG TGCCAGTCAC TTTGGGGACC 
8801 TGGAGCCCAG CCCGCTGCGC CACTTTGCCT CCAGACCGCT AATACTCGGG 
8851 GAAATGCTTT TGGGAGCTGT GGGCGCAACC CCAGTGGCAG TTATGTGTCC 
8901 TGCACCCCTA GGTAAGTGAG GAAACCTGGC TCCTCCTTTG GGTTTCTGAG 
8951 AGCCTTGGCC CTGCTCCTAC TAACTCTGTG TGCCCTTCCC CCTCNNNNNN 
9001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNTTACGG 
9051 CATTTGTAGT TACTCACACT TTTGCCTTCA NACAGCTAAT ACTCGGGGAA 
9101 ATGCTTTTGG GAGCTGTGGG CGCAACCCCA GTGGCAGTTA TGTGTCCTGC 
9151 ACCCCTAGGT AAGTGAGGAA ACCTGGCTCC TCCTTTGGGT TTCTGAGAGC 
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9201 CTTGGCCCTG CTCCTACTAA CTCTGTGTGC CCTTCCCCCT CCCCACAGAG 
9251 ATGCCATTTG TGGGCAGCTC CAGTGCCAGA CAGGTAGGAC CCAGCCTCTG 
9301 CTGGGCTCCA TCCGGGATCT ACTCTGGGAG ACAATAGATG TGAATGGGAC 
9351 TGAGCTGAAC TGCAGCTGGG TGCACCTGGA CCTGGGCAGT GATGTGGCCC 
9401 AGCCCCTCCT GACTCTGCCT GGCACAGCCT GTGGCCCTGG CCTGGTGAGC 
9451 AGCCTGGGTG GGCAAGACCA GGTGTGAGAA GGGACATTTG GACCACAATG 
9501 AACAGAGCCC AGACTTCACC ATTCACCAAT GTCAAAGGCA GGGACTCCAA 
9551 GGGAAGTCAG TTTCTTACTT CAGATGGAGC AAAGTCCTAT CAACTCACTA 
9601 TGCCTTGGTT TCCCCATCTG TAAACGCAGG GTATGGCCTC AACCTTATTG 
9651 GCCTCCCAGT CCCATTAAAG CTTTGTGGGA ATCTGATCCA GGCTCTTCTC 
9701 TCCCTGGGTC AGGTGTGTAT AGACCATCGA TGCCAGCGTG TGGATCTCCT 
9751 GGGGGCACAG GAATGTCGAA GCAAATGCCA TGGACATGGG GTGAGCTGGG 
9801 ATGGGGGAAG TGGAAGGGGA GCAGAGAGCC TCTAGAGAGG AAAAGGATAC 
9851 TGGGCTTTGG AAATAGACAT ATCTGGGTTT TAATCCTTGC TCTACTACTT 
9901 CCCAGTTGTG TGACCTCGGG CAGGTTACTA ACTTTGCTGA GCTCAGTTTC 
9951 CCCACCTATC AAATGGCTAT AATAATAGTA TCCCCATCCA GG GTACAT GA 
10001 GATGTGTATG CAAGCAAGTA GCACAGTGGG TAACTAATAG TGCTTTTAAA 
10051 AANNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
10951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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11501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
11951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
12951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13751 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
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13801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13851 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13901 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
13951 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14001 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14051 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14101 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14151 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14201 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14251 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14301 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14351 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14401 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14451 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14501 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14551 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14601 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14651 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14701 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 
14751 NNNNNNNNNN NNNNNNNNNN NNNNNNTTTT TGAAAGCTAC TAGTAGGTCA 
14801 CCA 1 1 1 1 1 I C TTGTCTTCCC GCAATCCAGA CCAGCGCCAC CGCCTCCGAC 
14851 AGTGTCCTCG CTCTACCTCT GACCTCTCCG GAGGTTCCGC TGCCTCCAAG 
14901 CCGGACTTAG GGCTTCAAGA GGCGGGCGTG CCCTCTGGAG TCCCCTACCA 
14951 TGACTGAAGG CGCCAGAGAC TGGCGGTGTC TTAAGACTCC GGGCACCGCC 
15001 ACGCGCTGTC AAGCAACACT CTGCGGACCT GCCGGCGTAG TTGCAGCGGG 
15051 GGCTTGGGGA GGGGCTGGGG GTTGGACGGG ATTGAGGAAG GTCCGCACAG 
15101 CCTGTCTCTG CTCAGTTGCA ATAAACGTGA CATCTTGGGA GCGTTCCCCA 
15151 GAGTTTGTCT GCTTCTAGAA CCCGGGTCGC TCCTGCTGCG GTTCCAGGTT 
15201 TGGCCGCCAG AAGACGCTGC CGCCTCAGAC GAGGGCGGGC TGTGTGGGGC 
15251 GGGAGTACCA GAAAGGGTCG GCGTGTGTCC CCGGGATGCT CGCAGCTTCC 
15301 CTCTGCCCAG ACTGGGGTGG CTTTCGGCGC AATCTGTCAA GCTGTTGGAC 
15351 CTGCCGTCCC CACTCTGACC ATTGGCTGGG AAAAGTGGAT CTGGCTGATG 
15401 CTCCCAGAGC CCAGGAGCCA GGGCGGAGCG GGGCGGCGGC TGCTCCCACG 
15451 ATCCCAAGGC CGCGCACCTG CCTCCTCCCC CTCCGCCGCC GCCACTTGAG 
15501 GGATCGGGAA CAAAGGTGCT TTGTACAGGC CGCAACCACC TCATTACTTC 
15551 GTCTTAGGGA CTGGGGCCGC GTGGGCCCCC AGCCCGGAAC GAAGGTGTGG 
15601 AGCGGCAAGG GACAGACGCC AATCTTAAAG TGAGCATCTA GCGCGCCACC 
15651 TAAGGCTCTT TAGGGAAGGT GGTCCCAGAG CTGTGTTGTC CCTTCCGCTT 
15701 GCACTGTCCC TAGATGTGCA AAGAAAACGG GGCAGTGCAT GAAGGTGGTT 
15751 GGACAGGCTT CATGGATCCT CGCCCGCGCC TCACTTTCCC CTATCTGGGC 
15801 AAAGGTTATG TACCCTTATT TAAAATCTTC CAAACTTCTA ATAAGGCAGT 
15851 CTACCCTGCA CTAAAGCAGA CACGAAAGAG ATGACCTCCC TAAAAATACT 
15901 GCTGTTGGAA TACGTCCTTC OTCCCGCCC CCTCGCAGTG CGGTGCAGCC 
15951 TCAGTGGAAG CTTTGGCGAA CCTGGCGCGC GCTGCGGTGC ACAGAGGGTT 
16001 AACTGGAGTT GGCGCTGGGT GGAGAGGAGG AGACGCGCTC CCATTGGCGG 
16051 AAAGTTATTC AGGGGCGGGG TCAGTGAATC TCCGTACCCC ACTCCCCTTT 
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16101 CCGCAACTTC CCTCTTCACT "TTGTACCTTT CTCTCCTCGA CTGTGAAGCG 
16151 GGCCGGGACC TGCCAGGCCA GACCAAACCG GACCTCGGGG GCGATGCGGC 
16201 TGCTGCCCCT GCTGCGGACT GTCCTATGGG CCGCGTCCTC GGCTCCCCTC 
16251 TGCGCGGGGG CTCCAGCCTC CGCCACGTAG TCTACTGGAA CTCCAGTAAC 
16301 CCCAGGTAGC CGGGCCGAAC CGGGCGAGCG CACAGCCAAG TCTGCGCGCT 
16351 CCCGGGCTTT GCGCGCGCCC GCCACCCGCT CTTTGCGCGG CGCCGCCTGA 
16401 GCCTGGCCGC GCGCCGGGGC TCCTTTGTTT GAGCCGGCGG GGGAGGGGGG 
16451 AGGGGCGAGG GGCGAGGCGC GCCCTGGGTC TCCCCACAGC CCGCATGTGT 
16501 TGGGGGGCAG GCAGAAGACC CCAGCCCCAA GGGTTGTCTA GGGGGTCTTG 
16551 GAGCATGGAG CTGGGGGGGC CTTTGCCCGC ACTCCGGGCT CCGCCCCCCT 
16601 CGCTGCTCTC CTGGCGATCC CCAGCCTCCC GCAGGCTGGA GCTGTGGCTG 
16651 ACGAACTTGA GAGCGAGGGA GGGGGCTTTA CTCTTATGAA AGAGCGTGGG 
16701 TTACTCTCCT GCCCGCTGGG TCTCACCTCT GGCTCTCACT CTGTCTCCTG 
16751 ATCTCATTTG CTATCTCTGC TTTCATCTCT GTCTTTATTG GTCCTTCTGT 
16801 TTCTTTCCAG TGTCAGCCCT GCCCTTCTAG CCGAATCACC TCTGGGCAAG 
16851 TCTCGTGACC TTCCTAACCT CATTTATCTC ACCTGTATAA TGGGCTAATA 
16901 ATACCTAGTA CCCTGGGAAG TCTGGCAGGG TAAGTGAGGT CATGTATGTG 
16951 AAAGAGGCTC AGGCTGTACA GATATAAACT ATTATTTCTT TCTCTCTCCT 
17001 GAGCTGCCTG CCTTTGAACC TTAGTATATT TTACTGTTTC CATCCCCCTC 
17051 CCCAAGTCTC CCTGCCTCTC CTATTTCCTA TCTGTTTTTC TTTCTGATTT 
17101 TCTACTTGAG ACAATCTGTG ACTATTCATT TCTTCACT 
(SEQ ID NO: 3) 
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DNA 

Position 

2522 TTAGGGTMTGGGGCCGGACGGAGACCCTGGGAGAGCCCAGCCAGAGCGCGGCCCGCCCT 
GGTCCGCTGTCCTGGGCCTAGGGCCCGGTGACTTG^ 

GGGGATGCCGGCGCCCCCTGCCTCCTGCCTGGTCATCCTCTGCGCGGTCCCrGCGGAG\C 

mCAGGCTCAGGTACCAGGTACCGAGGGGCCTGTCCAGCGCCACTTCMGATCCT 

AGAGGGTCGCTGCTCCCCAGGACTGGCATCTTCGCTGCTCT 

[C,G] 

ACCCGGTGCCAGGGCGCTGAGCGGGCATGGCTTGTAGGGTTTAGTGMGAGGATTCT 
TAGCCTCTATTCQVGGCCTGGG^ 

GTGCCTGGCCGAGCGGGAGGGGCCCGAGATGAGCCAGGAGMGGGAGMTTGGCC^ 
AGAGGCTGGGACACCAACT CCTCCTTGGMCTTTC^CTTCCCGCrGCrCTCTTGGGCCGG 
GACCGAGAGGGCAGGCGCGGGTGGAGTGTCCGGAGGAGAGAGGGCCATTGTGTGTT 
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GGGCCTGAGGTTTTCTGGTTAGAGAGGCrGG^ 
TGCCCTCTGGTGCCCACAGACCAGTC^^ 

CGGTGACAGTCATATCCrGGAGCTGCT ACAGAATAGGTAATACTGATGGTGGG\ATAACA 
GTGACCACATGGCCAACAAGTGT ATAGC^rTTATTATGTGCCAGGTACTMGTGCTTOT 
GCTCATTTMTCCrCATMCAQCCCrATMGGGATATACT ATCATGT ATTATTGTCCTCA 
[C,T] 

TrTATACATGAGGMGTCMGQCACAGAGAGATTAMTMCTT^ 
TAGTATGTGGTGAAMCCAGATTGGMTTCA^ 
AGTATACTTTGCTGCCCCGGGGTCAGG^ 
CCCAGCCCCACMC<^CCTTCMGC^^ 

TTTAGCACTGCCI ICI 1 1 1 ICI I CACTTCACAGGGGAGTTGGTCCCATGTCCGCCCCGAC 

AGGTGGTTTGCCCGAGGCCCT ACAACT AATAMTGGCCTATCG\TTTATTAGTTGTATTT 
GGCTCTTCATCTGTCTTATGAT^^ ATAGATAATA 
CATAGTTACCMTGATGAAGG\ATATAMCCCMTTTCCTM 
TAAMCTACTTGCTGCATAGAGTTGCrGGGMGA^ 

TGGTAAAACTATGCCAGACCrATGGTAAGTGACAAGAG I IGI IATTGGGAI I 1 1 IAAAA 
[T,-] 

TATTATTATTATTATTATTATTAT^ 

TGCAGTGGCGTGATCTCGGCTCACTGCAAGCTCCGCCTCCCAGGTT^ 
GCCTCAGCCTCCCGAGT AGCTGGGACT ACAGGCGCCCGCCACTACACCCGGCTAATGTTT 
TGTAI I I I I I AGTACAGAG\GGGTTTCACCGTGTT ATCCAGGATGGTCTCGATCTCCTGA 
CCTCATGATC(^CCCGCCTTGTCCTCCCAMGTGCTGAGATTAC^ 

TGCGAGCQGCAGGTTTGCCTGATAATTL I ICI I GTCCATAGTAGAGGCGGGATGTGGTAA 

CAGAGACCAAGACTGTGGAGTTGGTGATTGTGGCTGATCACTCG^ 

CCCCTGCA<^TCCTCCTCCCCCTGCACTGCCCTGCCGCCTTTCATGTC^^ 

CTAG^GGCCCAGAMTACCGGGACTTCCAGCACCTGCTAMCCGCACACT 

CTCTTGCTGGACACAGTGAGTGCTGGACAGG 

[G,A] 

CMCCCCTCrrCTGAGCCCCAGCTGTCTTTCAGI ICI I CCGGCCCCTGAATGTACGAGTG 

GCACTAGTGGGCCTGGAGGCCTGGACC^ 

GCTCTCACCCTCGAAMCTTCCrCCACTGGCGCAGGGCACATrrGCT 

CATGACAGTGCCCAGCTGGTGACGTM^ 

TCCTGGCCAMTTCACACCCCTrCAGCACCCTACCTCAGCCC 

TATirCCAGGTCTCCTCCrCATTCCCMTTCACTTCCTCCCMCT 

CACTGAMCCCCCCTATAMGTTGCCCM^ 

ACGTGATGTGGCCTTTGCTATCAGG 

ACAGTCATCCCCTGTGCAGTCTTCCCAI I ICI I AGAGGAGGGTAGGAGGCAGCT AAGGCC 

CAAAGMCAGAGGTGATCTCCCTC^CT 

[A,C] 

GTTTCTGCCATCCAGGCCrGGGTTCrCCrACTTT AGMGCAATTGVGGAGGGAAGCAGTG 

CCTGCTGAGTGCCCACGAGGTCAGACGTGGAGGGMCAGGAGCAGAGAG 

CATTGTGGTGGAGGCAGGCTGGGACTGGACCTACAGTACCCCTCCCCM 



FIGURE 3J 



Docket No.: CL001177DIV2 
Serial No.: (to be assigned) 
Inventors: Gennady MERKULOV et al. 
Title: ISOLATED HUMAN METALLOPROTEASE... 

CTCCACCAGCATCCTGGGACTCGCCT 

CCTGGACCATGATTTGCCrGGGAATAGCTGCCCCTGTCCAGGTCCAGCCCCAGCCAAGAC 

15505 CGCCAGMGACGCrGCCGCCTC^GACGAGGGCGGGCTGTGrGGGGCGGGAGTACCAGAAA 
GGGTCGGCGrrGTGTCCCCGQGATGCTCGCAGGT CCCT CTGCCCAGACrGGGGrGGCTTT 
CGGCGCMTCTGTCMCCTGTTGGACCrGCCGrCCCCACTCrGACCATTGGCT 
GrGGATCTGGCTGAT(KTCCCAGAGCCCAGGAGCCAGGGCGGA<^GG(^<^G(CT^ 
CCG^CGATCCC^AGGCCC£GCACCTGCCrCCTCCCCCrCCGCCGCC(^(^aTGAG(^^ 
[C,T] 

GGGMCAMGGTGCrTTGTACAGGCCGC^ACGkCCrCATTAaTCGrCrrAGGC^Cr^ 

GCCGCGTGGGCCCCCAGCCCGGAACGAAGGTGTGGAGCGGG\AGGGAG^GACGCCAATCT 

TAMGrGAGCATCTAGCGCGCCACG7\AGGCTCTTTAGGGMGGTGOT 

TTGTCCCTTCCGCTTGG^CrGTCCCrAGATGTGCAAAGAAMCGGGGG^CT 

TGGTTGGACAGGGTCATQGATCCTCGCCCGCGCCTCACTTTCCCCT^^ 

16123 AMTCTTCOW^CTTCTMTM^ 

GACCTCCCTAAAMTACrGCrGTTQGMTACGTCCTTCCTTCCCGCC^ 

GTGCAGCCTCAGTGGAAGGTTGGCGAACCrGGCGCGCGCrGCGGrGCACAG^ 

CrGGAGTTGGCGCTQQGTGGAGAGGAGGAC^CGCGCrCCCATrGGCGGAMGTTATr^G 

GGGCGGGGTCAGTGMTCTCCGTACCCCACTCCCGTTCCGCMGTCCCrGT^Cr^ 

[A,G] 

TACCrrrCTCrCCrCGACTGTGA^GCGGGCCGGGACCTGCCAGGCCAGAC 
CTCGGGGGCGATGCQGCTGCTGCCCCTGCTGCGGACrGTCCT ATGGGCCGCGTCCTCGGC 
TCCCCTCTGC(£GG(^^ 

AGGTAGCCQGGCCGMCCGGGCGAGCGOXC^GCCMGTCTGCGCGCrCCCGGGCTTTGCG 
CGCGCCCGCCACCCGCTOTTTGCGCGGCGCCGCCTGAGCCTGGCCGCGCGCCGGG^ 

Chromosome map: 
Chromosome # 1 
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